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Abstract — Recently, it has been shown that the max flow 
capacity can be achieved in a multicast network using network 
coding. In this paper, we propose and analyze a more realistic 
model for wireless random networks. We prove that the capacity 
of network coding for this model is concentrated around the 
expected value of its minimum cut. Furthermore, we establish 
upper and lower bounds for wireless nodes using Chernoff bound. 
Our experiments show that our theoretical predictions are well 
matched by simulation results. 

I. Introduction 

Traditionally, the information flow in networks is modeled 
as a multi-commodity flow problem by treating the underlying 
network as a flow network. Suppose that one source node in 
a graph has to transfer some information to one destination 
node (i.e., a unicast situation). By Menger's theorem [2], the 
maximum information that can flow is upper bounded by 
the value of the minimum cut between the source and the 
destination; this well-known result from classical graph theory 
is also known as the Max-flow Min-Cut theorem. One can 
use max-flow min-cut algorithms to compute the maximum 
throughput for instance for unicast, multicast, and multi- source 
multicast communications. 

Recently, Ahlswede, Cai, Li, and Yeung proposed in the 
seminal paper [1] a new paradigm, called network coding. 
Their key observation was that traditional store-and-forward 
networks cannot always achieve the max-flow value, whereas 
one can achieve this value using network coding. The idea is 
based on the simple fact that information can be replicated, 
mixed together and then transmitted over links to save band- 
width. If this is properly done, then the information can be 
reliably decoded at the receiver nodes, see e.g. [1], [14]. The 
basic idea of network coding is that the intermediate network 
nodes can now process, encode, and transmit information. 

Since its inception by Ahlswede et al., there has been an 
upsurge of interest in network coding, see for example [3]- 
[7], [9], [11] and the references therein. Arguably, most 
network coding publications model the underlying network 
as a directed acyclic graph and are typically concerned with 
solving single source multicast or multi-source multicast using 
deterministic or randomized encoding and decoding schemes. 

In this paper, we discuss a new model for wireless random 
networks. In this model, nodes are placed at random locations. 



Two nodes u and v are connected with probability 1 if the 
distance between them is less than or equal to r; the nodes 
are connected with probability p < 1 if the distance between 
them is less than or equal to R but greater than r; otherwise 
u and V are not connected. Thus, the model is a refinement of 
geometric random graphs that incorporates the potential loss of 
connectivity towards the end of the transmission range, where 
interference is more dominant. The main contributions of this 
paper are: 

• We introduce the quasi random geometric graph model, a 
model of wireless network topologies that simulates the 
connectivity in mobile ad-hoc networks more realistically 
than the random graph model, but is still easy to analyze. 

• We derive high-probability bounds for the network coding 
capacity of quasi random geometric graphs. 

• We provide simulations results that support our bounds 
on the network coding capacity. 

The rest of this paper is organized as follows. In Section II, 
we give an overview of network coding and the previous work 
in capacity of network coding. In Section III, we present our 
new model. We provide our main results in Sections IV and V. 

II. Background and Model Description 

In this section, we give a short summary of network coding, 
focusing on the calculation of the capacity of a min cut in a 
weighted random graph. For a more in depth discussion of 
basic concepts and methods of network coding, we refer the 
reader to the survey paper [3]. 

A. Network Coding Fundamentals 

To illustrate the power of network coding, we provide a 
simple example, which is often referred to as the Wheatstone 
bridge, due to its electrical circuits origin. It demonstrates that 
multicast routing can achieve the maximum possible through- 
put in a communication network using a coding scheme con- 
sisting of linear operations in finite field, whereas traditional 
store-and-forward routing cannot achieve the same throughput. 

Consider the example shown in Fig. 1(a), where the nodes 
X and Y respectively want to send two bits bi and 62 to each 
other. One way of doing this is to let the bit bi travel on the 
path X^A^B^Y dii one point of time and to let 62 
travel on the path Y ^ A ^ B ^ X on other. However, 



if the network wants to transmit the bits simultaneously, then 
there is no way to do so, as there are no disjoint paths between 
X and Y. 

However, using network coding as shown in Fig. 1(b), one 
can save bandwidth. In this case, both X and Y transmit the 
bits hi and 62 (as shown in the figure) and then A XORs 
{encodes) them together and the resulting bit hi 62 travels 
over the paths A ^ B ^ Y and A ^ B ^ X. Since node X 
already has of hi, it can recover {decode) 62 by the operation 
hi (61 62). Similarly Y can also decode hi. 

This example illustrates that the capacity of the minimum 
cut (equal to 1 in this example) can be easily achieved by 
network coding, whereas two rounds are needed to achieve the 
multicast in the uncoded (traditional) routing case, assuming 
unit capacity edges. Because of such benefits, network coding 
can be used in wireless ad-hoc networks or sensor networks to 
help conserve energy and to increase the overall throughput. 




Fig. 1. An example of network coding on a Wheatstone Bridge 

B. Network Coding in Ad-hoc Wireless Networks 

In [13], Ramamoorthy et al. modeled the capacities of the 
connected edges in a wireless network as a Weighted Random 
Geometric Graph {Q^^^^) and considered the single source 
multicast problem. 

Definition 1 (Single Source Multicast Problem): Let G — 
{V.,E) be a graph with vertex set V and edge set E rep- 
resenting a network. Let C F be a set of sources (origins) 
and T C ]/ be a set of terminals (destinations). The multicast 
problem is to distribute the messages from the senders s ^ S 
to all terminal nodes t e T, allowing routing along the edges 
of G. In network coding, the vertices are allowed to encode the 
incoming bits (or packets) and send encoded versions along 
the outgoing edges. A single source multicast problem is the 
special case where one has a single sender, that is, \S\ = 1. 

Ramamoorthy et al. extended the results proved by Karger 
et al. in [8] and used them to derive bounds for coding capacity 
for a single source multicast problem in a network comprised 
of a single source s, an intermediate network consisting of 
n relay nodes, and / terminal nodes, having independent 
and identically distributed link capacities ~ X between any 
two nodes. They showed that the network coding capacity is 
concentrated around the value nE[X] in such a network. 

In this paper, we extend their work to a more general and 
more realistic model that we call the Quasi Random Geometric 



Graph model (Q^^^^). We derive high-probability bounds for 
the network coding capacity of such graphs. 

III. Modeling Random Wireless Networks 

In this section, we present our new model and study the 
capacity of a minimum cut in a random wireless network. 

Let r be a real number in the range < r < 1. Recall that 
a Random Geometric Graph is a graph Q^^^ = {V^E) with 
n nodes selected independently and uniformly at random from 
the unit square [0, 1]^ in which any two nodes u and v inV are 
connected by an edge {u^v) in E if and only if the Euclidean 
distance d{u^v) < r. Such a graph is rough approximation of 
wireless networks. 

Random geometric graphs have been popular in wireless 
mobile ad-hoc networks literature, since it is a theoretical 
model of the network topology that is easy to analyze. How- 
ever, it does not realistically model the area of transmission, 
which is, in general, not a disk of radius r. Recently, a 
more realistic model for connectivity was proposed by Kuhn, 
Wattenhofer, and Zollinger [10]. In their model, two nodes u 
and V may or may not be connected when their Euclidean 
distance d{u^v) is within the range r < d{u^v) < r^ see 
Fig. 2. We use random instances of such quasi-disk graphs to 
model the dynamically changing network topology in wireless 
random ad-hoc networks. 




Fig. 2. The transmission range and Quasi Disk Graph Representation for a 
node 

Definition 2 (Quasi Random Geometric Graph (Q^^^^)): 
Let r and r' be two real numbers in the range < r < r' < 1. 
Let F be a set of n nodes that are selected independently and 
uniformly at random from the unit square [0, 1]^. If u and v 
are two nodes in V, then 

1) {u,v) e E if d{u,v) < r; 

2) lu,v) ^ E if d{u,v) > t'\ 

3) (ix, v) e E with probability p if r < d{u^ v) <r'. 

We call gQ^G^ = (V^E) a quasi random geometric graph. 

The difference between quasi random geometric graphs and 
random geometric graphs is that nodes at distance d within the 
range r < d < r' may or may not be connected; this models 
the connectivity in a more realistic way. 

Remark 3: Instead of having a fixed probability p for the 
connectivity of nodes within distance d in the range r < d < 
r\ one can use a function p{d) that associates a probability 



that depends on the distance to model the attenuation of the 
signal. Such a change is of course straightforward. We give 
one example in Section V. 

In this paper, we consider the problem of single- source 
multicasts in such quasi random geometric graphs. Our main 
concern is to provide a lower bound for the capacity of network 
coding in this situation. Before defining the capacity, we need 
to further detail our model of connectivity. 

Definition 4 (Connectivity Graph): Let s be a source node, 
T a set of terminal nodes, and R a set of relay nodes. We define 
a connectivity graph G = (V^E) as a graph with vertex set 
V = {s}U RUT such that G G QQ^^^; in particular, the 
vertices are located in a unit square. We assume further that 
the source node only sends messages and terminal nodes only 
receive messages; in particular, the source and terminal nodes 
do not relay any messages. Furthermore, we assume that the 
source and the terminal nodes do not communicate directly; 
thus, any message is routed through at least one relay node. 

We assume that the edges in the connectivity graph represent 
links with unit capacity. Put differently, we assume that the 
capacity Cij for i^j in V is given by 



I ^ 



if {iJ)eE, 

otherwise. 



We note that Cij = Cji, since the graph is undirected. 

Definition 5 (A Cut and its Capacity): Let G = (V^E) be 
a connectivity graph with source node 5, a set T of terminal 
nodes, and a set R of relay nodes such that V = {s} URUT. 
Let t be a terminal node in T. An s-t-cut of size k in the 
connectivity graph G is a partition of the set of relay nodes 
R into two sets Vk and Vk such that 

(i) \Vk\ = ksind\Vk\=n-k; 

(ii) R = VkUVk and VknVk = 0. 
The edges crossing the cut are given by 

1) En{{s,i)\ieVkh 

2) En{{j,t)\jeVk}; __ 

3) En{{j,i)\j eVk sindieVk}. 

In other words, the source node s and the relay nodes Vk are 
on one side of the cut, whereas the relay nodes Vk and the 
terminal node t on the other side of the cut. The total capacity 
of an s-t-cut of size k is given by 



Ck = "^Csi^ Yl Yl ^Y^i 



(1) 



jeVk ieVk 



IV. Bounds and Results 



In this section, we bound the network coding capacity of a 
connectivity graph, where the connections of the relay nodes 
form an instance of a quasi random geometric graph. 

Let G = {VjE)hQ 3. connectivity graph such that the vertex 
set V consists of a source node s, a set of terminal nodes 
T, and a set of relay nodes R, that is, V = {s} U T U R. 
Recall that two nodes u and v in G are connected by an 
edge with probability 1 if d{u,v) < r, with probability p if 
r < d{u^ v) < r^ and with probability otherwise. Therefore, 



the probability p' that two nodes u and v are connected can 
be bounded by 

^ {irr^ + 7r(r'^ - r^)p) < p' < irr^ + 7r(/^ - r^)j9. (2) 

The motivation for the lower bound stems from the fact that 
one of the nodes might be located in one of the corners of the 
unit square. The upper bound is a straightforward consequence 
of our connectivity rules. 

These elementary observations allow us to bound the ex- 
pected value of the cut Ck- By equation (1), we have 

= p'{n + k{n — k)). 

In particular, E[Cfe] = E[Cn-/c] holds for all k in the range 
{) < k < n. Furthermore, we have 

E[Co] = E[Cn] < E[Ci] = E[Cn-i] < • • • < nC\nm]- 

Our goal is to prove that the capacity Ck of an s-t-cwi is 
concentrated around its expected value. A technical difficulty 
arises because the edges between relay nodes in the graph G 
are in general not mutually independent. Indeed, if two relay 
nodes u and v are connected, and u is connected to yet another 
relay node then there is a good chance that v is connected 
to w. Put differently, we have 

Vi[{v,w) e E\{u,v) e E,{u,w) eE]> Vi[{v,w) e E], 

whence the three events {u^ v) G E, {u^ w) G E, and (v, w) G 
E are not independent. In Fig. 3, we sketch different geometric 
situations between two nodes; positioning a node w within the 
transmission range of u nicely illustrates the intuition behind 
this fact. 




Fig. 3. Consider two relay nodes u and v of G\pi. Subfigure (a) illustrates 
the situation when the two nodes are not connected and far apart. The 
other subfigures illustrate the following situations: (b) d{u^v) < r, (c) 
r < d(u, v) < r' , and (d) 2r < d{u^ v) < r -\- r' . 



However, certain edges in a connectivity graph are indepen- 
dent. Indeed, all edges that are incident with a fixed (common) 
vertex are independent, since the coordinates of the vertices 
in the underlying quasi geometric random graph are chosen 
independently and uniformly at random. Consequently, the 
random variables in the set {Cij \j G /}, where i is fixed, 
are independent. We will take advantage of this fact in our 



proof of the concentration result. To that end, recall Chemoff 's 
bound for sums of independent Bernoulli random variables. 

Lemma 6 ( Chernoff bound): Let Xi , . . . , be indepen- 
dent Bernoulli random variables such that Pr[X/c = I] = p' 
and Pr[Xfc = 0] = 1 - p\ and let X = ELi^/c- For 
< e < 1, we have 

Pr[X < (1 - e)E[X]] < g-^M^'/^ 

Proof: See, for instance, [12, p. 66] for a proof of this 
well-known bound. ■ 

Lemma 7: If Pr[X < a] < Pr[X < ^][jPr[Y < f]. 

Proof: This is because if X > a/2 and Y > a/2, 
definitely X + F > a. So if X + F < a, at least one of 
X and Y must be less than a/2. ■ 
This lemma is quite simple, but it turns out play a crucial role 
in the proof of following theorem. 

Theorem 8: For all cuts of size k, and all e, we have 



FT[Ck < (1 - e)E[Ck]] < e 



-0 



-ln(fc+l) 



Proof: Let Ck denote the capacity of an s-t-cut {{s} U 
Vfc; Vfe U {t}) in the connectivity graph. Here s is the source 
node, Hs a terminal node, and 14 U is a partitition of the 
relay nodes into two disjoint sets Vk and Vk that respectively 
have cardinality k = \Vk\ and n — k = \Vk\- 
We can reformulate equation (1) in the form 



c 



(3) 



So following lemma 7 and the above formula, if the event 



Cfe < (1 - e)E[Cfe] 



(4) 



happens, then at least one of the following k+1 simpler events 
must happen also 

(i) T.iev,Csi<{l-e)E[Ck]/{k + l), 

(ii) EieF.u{t} Cj. < (1 - e)E[Cu]/{k + 1), 

where j G Vk- Since the left hand side of (i) and (ii) are 
sums of independent Bernoulli random variables, we can use 
Lemma 6 to bound the probability of these events. Therefore, 
we obtain the estimate 

Pr[Cfc < (1 - e)E[Cfe]] 

< Pr[ ^ Csi < (1 - e)E[Ck\l{k + 1)] 

+ Ep^[ E Cj,<(l-e)E[Cky(k + l)] 
jeVk i€VkU{t} 

< Pr[ ^ Cs^ < (1 - e)(fc + l)(n - k)p'/{k + 1)] 

+ E (C,i<(l-e)(fc + l) 

jeVk ieVkU{t} 

(n-fe + iy/(fe + l))] 

< exp(-(n- A:yeV2) 

+ A:exp(-(n - A: + l)p'e'^/2) 

< exp(-((n - k)p'e^/2 - \ii{k + 1))). 



In the next two theorems, we are going to show that 
the capacity of a minimum cut is, with high probability, 
concentrated about the value np' = E[Co]. Intuitively, it 
is not surprising that the bottleneck is likely going to be 
the connection from the source to the relay nodes, so the 
dissemination of the information is likely to be limited. 

Theorem 9: Let G be a connectivity graph with one source 
node s, n relay nodes, and a set T of terminal nodes. Then, 
with probability 1 — 0{r/n'^), where r = |T|, the network 
coding capacity Cs^t of G is bounded from below by 



Cs,T > (1 - e)E[Co], where e = 



I 4 Inn 
p'{n — k) ' 



where p' satisfies (2). 

Proof: Let Cmin{s-, t) denote the capacity of a minimum 
s-t-cut Let us assume further that this minimum cut has size 
k, that is, Cmin{s,t) = Ck- Since E[C/c] > E[Co] holds for 
all k in the range < /c < n, we have 

Pr[C«,„(s,t) < (1 - e)E[Co]] < Pr[Cfe < (1 - e)E[Ck]] 
<Pr[|Cfe-E[Cfe]| >eE[Cfe]] 
< 2exp ((-(n - k)p'e^/2 + \n{k + 1))) , 

where the last inequality is due to Theorem 8. Substituting the 
value of e from the hypothesis yields 

Pr[Cmin < (1 - e)E[Co]] < 2exp(-21nn) 

= 0(l/n2). 

Consequently, the probability that the network coding capacity 
Cs,T will be below the value (1 — e)E[Co] can be bounded by 

Pr[C,,T< (1 - e)E[Co]] 



< Pr 



\J{Cmin{s,t) < (l-e)E[Co]) 



< ^Pr[C™„(s,t) < (l-e)E[Co]] 

teT 
= 0(r/n2), 

as claimed. ■ 
We complement the above lower bound by a high- 
probability upper bound on the network coding capacity. 

Theorem 10: Let G be a connectivity graph with one source 
node 5, n relay nodes, and a set T of terminal nodes. Then, 
with probability 1 — 0(l/n^/^), the network coding capacity 
Cs^T of G is bounded from above by 



C5,T < (l + e)E[Co], where e 



/ 41nn 



where p' satisfies (2). 

Proof: If the network coding capacity Cs^t exceeds the 
value (1 + e)E[Co], then the capacity of any 5-t-cut, for any 
teT, must exceed that value as well; in particular, the cut 
{{s};RUT) must have capacity exceeding (l + e)E[Co]. Since 



we assume that the source node is not directly connected to 
any terminal node, we obtain 



Pr[C,,T > (1 + 



e)E[Co]] 

Csr > (1 



)E[Co]: 



<Pr[IE.eflC'..-E[Co]| >eE[Co]] 

The indicator random variables Cgr, with r e R, are mutually 
independent, as the location of the relay nodes are indepen- 
dently and identically distributed in the unit square. Recall 
that the Chernoff bound for independent identically distributed 
indicator random variables Xi with Pr[X^ = 1] = is given 
by Pr[|^^^^Xi - np'\ > t] < 2exp{-t^/3np'). Applying 
this Chernoff bound to the indicator random variables Csr 
yields 

MEreRCsr-nCo]\>eE[Co]] 



< 2exp( 
= 2 exp I 



2E[Co]V(3E[Co])) 
41nn E[Co]^ 

E[Co] 3E[Co] 



which proves the claim. ■ 
Remark 11: Our results easily generalize to more general 
substrates of unit area (not just unit squares), as long as 
the assumption holds that the nodes are uniformly distributed 
over the area. The concentration results are not affected by 
such a change, but the connectivity probability p' might be 
dramatically different. For instance, if the area is a rectangle 
that is £ high and 1/e wide, then p' approaches as e 
approaches 0. 

V. Simulations and Experiments 

We conducted simulations for various instances of gQ^^^ 
using different parameters. Our simulation results support the 
high probability bounds on the network coding capacity given 
in Theorems 9 and 10. 

In a first experiment, we determined the minimum capacity 
of an s-t cut for different instances of a connectivity graph 
in qQ^^^ with a fixed number of nodes. Fig. 4 shows the 
results of such an experiment with n = 200 relay nodes. The 
radio transmission range is chosen such that within a radius 
of r = 0.1 the connectivity is guaranteed and up to a radius 
of = 0.2 one might get connected. The plot shows that the 
capacity of the network is concentrated around the expected 
value of 13 which is in agreement with Theorem 9 and 10 for 
the above values of n, r and r^ 

Fig. 5 shows the result of a second experiment. This time, 
the number of relay nodes is once again n = 200, but 
the transmission range is higher, namely the inner radius 
equals r = 0.13 and outer radius equals r' = 0.18. We 
generated random instances of gQ^^^ with these parameters 
and determined the minimum cut. One can easily see that the 
capacity of the network is likely to be higher, as expected. 

For larger n, we could observe that the histograms become 
more concentrated around the expected capacity of a minimum 
cut, as predicted by our theory. 
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Fig. 4. n=200, r=0.1, = 0.2 
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Fig. 5. n=200, r=0.13, r' = 0.18 



In a third series of experiments, we simulated the increase 
of capacity of the minimum cut for different values of r and 
n. In this case, we also modeled the connectivity probability 
as a decreasing function of distance, following Remark 3, 



p ■■ 



1 



Pconnection •> 



where d{i^j) is the Euclidean distance between any two 
nodes i and j such that r < \d{ij)\ < r\ and PconnecUon 
is a probability that accounts for the interference noise in 
communication. 

As it can be seen from Fig. 6, the value of the capacity 
grows more rapidly for lower values of r. This is intuitive 
because in that case not many nodes are connected for small 
values of n. As we increase n but keep r constant, the capacity 
of the minimum cut must increase, since more and more nodes 
are packed in the same area. 

VI. Conclusion 

We modeled a quasi wireless random network and showed 
that the capacity of the minimum cut of network coding is 
concentrated around the value np' = E[Co]. Unlike prior 




Fig. 6. The capacity of s-t minimum cuts with different values of n and r 



works, we obtained high probabiHty bounds for this model. 
More reaUstic models (for example, when the probability 
of connectivity drops exponentially with distance to account 
for signal attenuation) can be easily incorporated into our 
framework without changing the theory in a significant way. 
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